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METHOD AND APPARATUS FOR QUEUE MANAGEMENT 

PRIORITY 

This application claims priority from United States Provisional Application 
60/137,082 entitled "Apparatus and Method of Design and Configuration of Active 
Queue Management in Routers and Switches" filed on June 2, 1999 which is 
incorporated by reference herein in its entirety. 

FIELD OF THE INVENTION 

The invention generally relates to networks and, more particularly, the invention 
relates to the management of a queue at a node in a network. 

BACKGROUND OF THE INVENTION 

Congestion occurs in a network when resource demands exceed capacity. In 
prior art communications networks, resource demands exceed capacity when data is sent 
on a path from a sender to a recipient and a node on the path cannot send data as quickly 
as it is received. In this case, the throughput of the node decreases and may drop to zero. 
When the throughput drops at the node, received packets build up in the node's 
memory, referred to as a buffer, increasing the number of accumulated packets forming a 
queue until the buffer is full and overflows. As the buffer overflows, data at the receiver 
may be delayed or the data may be lost. Such a state is generally a transient condition in 
a network as users of the network vie for resources during peak time periods. In the past, 
nodes in high-speed networks have been forced to include large buffers in an attempt to 
avoid overflow during periods of congestion. As a result of increasing buffer size, 
defined as accumulated packets waiting to be serviced, the average queue size increases. 
The average queue size for a buffer is the average number of packets present in the 
buffer. 

One technique for avoiding large queues and large network delays is Random 
Early Detection (RED). RED is designed to accompany transport-layer congestion 
control protocols such as TCP and operates as a mechanism for regulating the amount of 
information that is sent to a node by decreasing the number of acknowledgment packets 
that are sent to the sender. The congestion control mechanism in TCP is a closed control 
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system that reacts to uUcknowledged packets by re-sending the^Thacknowledged 
packets and reducing the transmission rate. Systems that implement RED detect 
congestion by computing the average queue size as data is received into a buffer. When 
the average queue size exceeds a preset threshold, the node refuses to service i.e. "drops" 
5 a percentage of packets as determined by a control function. 

The queue sizes determined by the RED technique, in combination with TCP 
congestion control, are subject to large size oscillations. Using the RED technique, 
parameters defining the control function are set by a system's administrator, without a 
methodology for determining values for the parameters. As such, the control function 

10 may be unstable and fail to adequately regulate the feedback to the TCP congestion 
control. The large oscillations that result under such circumstances in one node can 
propagate to other nodes and cause erratic behavior in the network. Because RED does 
not define a methodology for calculating the parameters, system administrators have 
used trial and error techniques. These trial and error techniques do not provide for a 

15 controllable network. 

SUMMARY OF THE INVENTION 

method, apparatus, and computer program product for determining a drop 
probability^: use in a congestion control module located in a node in a network are 
disclosed. A werght value for determining a weighted moving average of a queue in a 
20 node is first systematically calculated. The weighted moving average is calculating and 
an average queue size fortr^e node is determined based upon the weighted moving 
average. A control function associated with the congestion control module is evaluated 
using the average queue size to determine the drop probability. The weight value may be 
calculated by first determining a sampling period for measuring the queue size. Next, a 
25 time period for which samples significantly contribute to the average queue size is 
calculated. The weight is determined based uporKlhe sampling period and the time 
period. In a further embodiment, the control function^ is calculated based upon a queue 
function where the queue function is calculated based ujK>n predetermined system 
parameters. The control function may be selected based upKm a queue policy for 
3t> management of the queue. From the queue policy a threshold\alue which lies along the 
queue function curve is determined. This point provides a minimum value for the 
maximum point of the control function so as to avoid oscillations within the buffer. A 
maximum point may then be selected which resides outside of the curvs for the queue 
function. The control function may then be selected so that the control function crosses 



through the^aximurf^fent. Thus, when the congestion contl^r^odule drops packets 
based upon the drbo probability determined by the control function the queue will not 
oscillate. 



BRIEF DESCRIPTION OF THE DRAWINGS 

The foregoing and other objects and advantages of the invention will be 
appreciated more fully from the following further description thereof with reference to 
the accompanying drawings wherein: 

Fig. 1 shows a schematic drawing of a communications network in which the 
apparatus and method for queue management may be implemented. 

Fig. 2 is a flow chart for determining the steady state operating point of a queue. 

Fig. 3 shows a simplified network, which is used to simulate a more complex 
communications network. 

Fig. 4 is a block diagram showing another simplification of the network where 
the network is reduced to a single-flow feedback system. 

Fig. 5 is a sample plot of the queue law function. 

Fig. 6 is a graphical representation of the queue law function and the control 
function in which the axes are the average queue size and the drop percentage. 
Fig. 7 is a graphical representation of a control function. 

Fig. 8 is a graphical representation showing two control functions superimposed 
upon the queue law function. 

Fig. 9 is a graphical representation of the ranges that the queue law function may 
take based upon the values for the number of flows, the size of a packet, and the round 
trip transmission time. 

Fig. 10 shows a flow chart of the steps used by either a designer of a congestion 
control module or of a system administrator setting the parameters for defining the 
control function of the congestion control module. 

Fig. 1 1 is a flow chart showing the steps taken in calculating the point (qmax, 

Pmax)- 

Fig. 12 is a flow chart showing the steps taken in calculating the minimum buffer 

size. 

Fig. 13 is a block diagram of another embodiment of a congestion control module 
containing a processor and a queue estimator. 

Fig. 14 is a flow chart of the method used in calculating the value of the 
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Fig. 15 is a graphical representation of the sending rate verses time for an 
exemplary communications network. 

Fig. 16 is an alternative embodiment of a congestion control module in which the 
queue law is used for long term adjustment of the drop probability. 

Fig. 17A shows an illustrative configuration module. 

Fig. 17B shows a control function module that resides within a congestion 
control module. 

. Fig. 18 A shows a weight calculation module. 

Fig. 18B shows a congestion control module which is similar to the congestion 
control module of Fig. 13, but Fig. 18B also shows the data which is necessary for 
configuring the congestion control module prior to operation. 



Fig. 1 shows a schematic drawing of a communications network in which the 
apparatus and method for queue management may be implemented. The communications 
network, such as the Internet, connects computers and other processing devices to other 
computers, servers, and processing devices via nodes so that data may be sent, received, 
and retrieved. All devices that forward and filter data, such as, but not limited to, 
routers, bridges, b-routers, switches, repeaters and gateways will be referred to as nodes 
in the following description and claims and all connections between such nodes shall be 
referred to as links. 

In the following description and appended claims the term "flow" shall refer to a 
signal representative of data sent from a sending node to a receiving node. In the 
description and the claims the term "capacity" refers to the total amount of data that a 
link can process in a given time and is equivalent to line speed. 

Nodes within the network each have at least one ingress and one egress port. A 
node may receive multiple flows of data, usually in the form of packets, into one or more 
of the ingress ports where each flow is stored and queued in a buffer prior to routing the 
data flow to an egress port in the node. As more information flows into the buffer, the 
queue becomes larger until the capacity of the buffer is reached and then the subsequent 
data is lost. To prevent buffer overflow, the buffer of each node is regulated by a node 
congestion control module that operates in conjunction with an end-system congestion 
control module. The node congestion control module regulates the average queue size by 
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idM^node that congestion is occurring at tnPre 



indicating to the sendM^node that congestion is occurring at fnWeceiving node. The 
end-system congestion control module refers to the part of the transport layer protocol 
that is responsible for adjusting the sending rate at the sending node. For example, in a 
network employing TCP as the transport layer protocol, the node congestion control 
5 module drops acknowledgement packets to indicate congestion and the end-system 

congestion control module decreases the sending rate in response. In another example, 
the node congestion control module may send an acknowledgement packet from the 
receiver indicating that congestion is occurring and again the end-system congestion 
control module decreases the sending rate at the sending node. It will be understood by 
10 those of ordinary skill in the art, that any end-system congestion control module may be 
used which causes a sending node to adjust its sending rate as the result of congestion. 
Additionally, the node congestion control module would be designed to work in 
conjunction with the chosen end-system congestion control module. For the remainder of 
this disclosure, the end-system congestion control module will refer to an embodiment in 
15 which TCP is the transport layer protocol. This is done for exemplary purposes and is in 
no way meant to limit the scope of the disclosed invention. 

In a TCP environment, when packets of data are dropped, the node does not send 
□ an acknowledgment packet to the sending node. As a result, the sending node slows 

ij down its sending rate, which has the effect of decreasing the size of the average queue in 

i; 20 the buffer of the receiving node. As stated before, the average queue size is the average 
3 amount of data found in the node's buffer. The average queue size shall be represented 

by q. The interaction between the node congestion control and the end-system congestion 
control has the effect of stabilizing the node's queue size and the rate it drops packets. 
The values of average queue size and drop probability in such a stable or steady state are 
25 referred to collectively together as a steady-state operating point. 

In a node that incorporates a congestion control module and implements one 
embodiment of the invention for queue management, the steady-state operating point can 
be calculated based on the solution for a system of two equations. The two equations 
being a queue law function and a control function. Fig. 2 shows a flow chart for 
30 determining the steady-state operating point. First the queue law is evaluated (step 210). 
This first function represents a characterization of the end-system's congestion control . 
In other words, the queue law function is an approximation of the average queue size of 
an exemplary queue in a node based upon the traffic characteristics and the percentage of 
dropped packets. For example, one embodiment of the queue law equation that will be 
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further explained and^^ved below can be: 
G{p) = min(5,c(7- R -'(p.c/n)- R^) 
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Round trip transmission time outside the queue 
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Next the control function is determined (Step 220). The control function is 
characteristic of the node congestion control module. It determines the drop percentage 
when the buffer of the node is filled above a predetermined threshold based upon an 

10 average queue size as an input parameter. The control function may be any type of 

function, but is preferably a linear function. The point of intersection between the queue 
law function and the control function determines a value to which the drop percentage 
should be set and the control module uses the drop percentage to drop packets evenly 
across all flows (step 230). 

15 The queue law may be used in variety of ways for the design, configuration, and 

operation of a congestion control module. In one embodiment, the congestion control 
module may be enabled to calculate both the queue law and the control function based 
upon input data concerning traffic characteristics, and determine the point of intersection 
of the functions thus determining the packet drop rate. In another embodiment, the queue 

20 law may be used to model the queue variation for a node over all drop probabilities 

based upon maximum and minimum traffic conditions. This modeling of the queue can 
aid a system administrator in determining configuration parameters for the control 
function of the congestion control module of a node in a network in order to avoid 
oscillations in the queue. In such an embodiment, a predefined control function exists in 

25 the congestion control module that has a definable shape, and the queue law provides 
information for configuring the shape so that the system remains stable. In yet another 
embodiment, the modeling of the queue law based upon the expected maximum and 
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minimum traffic con^Wons can be used to define a range of opBfation for a node. The 
ranges of operation can then be used in the design of a node to determine the minimum 
buffer size. 

The derivation of the queue law function is shown in the next two figures. Fig. 3 
shows a network that is used as a model for determining the queue law for a complex 
communications network. The network of Fig. 3 is a simplification of a real network 
such as the Internet. In this simplified network, the transport layer protocol is TCP. The 
simplified network is used to determine a queue law function for a node based upon 
known traffic characteristics for the node, such as, the line speed, number of TCP flows 
and propagation delay. The queue law function determines an average queue size for a 
buffer based upon a drop percentage, where the drop percentage is an independent 
variable. 

In the simplified network, a node congestion control module is situated in node 
B. The communications system acts as a feedback control system in which the control 
module is the sending node, the controlling element is the node congestion control 
module, the feedback signal is the drop percentage, and the controlled variable is the 
sending rate. A total of n flows, from node Ai through node A n , flow into node B, which 
contains a node congestion control module. Node B is connected to node C through a 
single link. 

One function/operation of the node control module is to maintain the cumulative 
data rate of all flows below or equal to the link's capacity so that the rate of the 
combined flows from node B, i.e. the throughput, is less than or equal to the capacity of 
the link between B and C. The throughput of a TCP flow is dependent on the drop 
percentage p, the average round trip time R, the average packet size M, the average 
number of packets acknowledged in one acknowledgement message b, the maximum 
congestion window size advertised by the flow's TCP receiver Wmax, and the duration of 
the basic TCP timeout T 0 . The throughput of a TCP flow is approximately equal to 
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which was derived in Padhye, Firoiu, Towsley and Kurose A Stochastic Model of TCP 
RENO congestion Avoidance and Control Technical Report CMPSCI TR-99-02 
University of Massachusetts, Amherst 1999 which is incorporated by reference herein in 
its entirety. 

For discussion purposes, the assumption is made that all links into node B and 
out of node C have enough capacity. Accordingly, the link between B and C is the only 
link where congestion can occur. All n flows combine at the buffer of node B and each 
flow provides digital data that adds to the queue. One simplification of the model 
network that is used to determine the queue law function assumes that there is only one 
link between two nodes. Additionally, packets of data are assumed to flow in only one 
direction. In Fig. 3 data flows from nodes Ai_ n to nodes Di_ n . The only data that flows in 
the other direction is assumed to be acknowledgment packets (ACKs). Further, the 
number of flows into node B is assumed to remains constant. Assuming that all flows 
have the same average round trip time and the same average packet size and that the 
maximum congestion window size is so large that it does not influence the throughput, 
the network may be reduced to a single- flow feedback system as shown in Fig. 4. 

Ba^ed on aforementioned assumptions, the queue law =G(p) is determined 
(where is the average size of the queue and p is the drop probability). Since the link 
between A and B is the^only possible point of congestion, the average round trip time of 
a packet is the sum of the average waiting time in the queue of node B and Ro, the 
propagation and the round-trip transmission time outside the node. Assuming a FIFO 
(First and First Out) queuing schemes^he average waiting time in the queue is qlc and 
the overall round trip time for a packet istte sum of the average waiting time in the 
queue and the propagation and transmission time on the rest of the round trip so that 
R=Ro+g/c. 

Regulating the queue due to congestion is only relevant when the link is fully 
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utilized, since the aviJ^ queue size is small otherwise. The^^f utilization can be 
determined through the following calculation: 



u(p) = 



T(p,R 0 ) 
cl n 



The link utilization u(p) takes values between 0 (not utilized) and 1 (fully utilized). If it 
is determined that the link is fully utilized, the queue law function for providing the 
average queue size is: 

G(p) = c{T- l (p y c/n)-R 0 ) 

where c is equal to the line speed and T R " l (p,T) is the inverse of T(p,R) in R, i.e., 

Tr"'(p,T(p,R)) = R. Since the average queue size cannot exceed the size of the buffer 
B, the queue law is: 

G(p) = max(s, c(t~ 1 (p, c I n) - R 0 )) 



A sample plot of the queue law function is provided in Fig. 5. It should be 
understood by those skilled in the art that additional simplifications may be made that 
result in different equations for the utilization and the average queue size. For example, 
if the system is assumed to be free of TCP timeouts, the throughput T(p,R) can be 
approximated as 



R \2bp 



and thus it can be easily shown that: 



( 



G(p) = max 



2bp 



-cR n 



if the link is fully utilized, i.e., if u(p)=l where 

TheTesuJting queue law equation can be programmed into a processor or programmed in 
software to be exTeeuted or embedded into an electronic chip associated with the node 
congestion control module^Tbr-automatic calculation during operation of the node based 
upon traffic conditions. Traffic cono^tloBajnclude the line speed c and the round trip 
time R, the number of flows, n, and the throughpo^or all variables necessary to calculate 
the throughput. For example, the minimum and maximurrK^qughput per flow (t^ t^) 
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which ^or a dial-up m^fen in a wide area network is 28.8Kb^^o^ x^ and 56Kb/s for 
T min however-tttespeed of x^ and x^ the connection is implementation dependent. 
Additional traffic characteristics include the minimum and maximum packet sizes (Mmin, 
Mmax) the minimum and max imbqv round trip time outside of the queue (Romin, R Omax) the 
minimum and maximum drop probability outside of the queue (pomin> Pomax) ar| d the 
minimum and maximum TCP receiver windb^CCW^) (W^) ^ ). 

Fig. 6 shows a graphical representation of the queue law function and the control 
function in which the axes on which the functions are graphed are the average queue size 
(Y- axis) and the drop percentage (X-axis). The two functions are each dependent on 
one variable that is the product of the other function and as a result, both functions 
intersect at a point. This intersection point is the expected equilibrium for the node based 
on the initial traffic conditions. The equilibrium point provides the drop percentage for 
the congestion control module so that data packets may be dropped as the data enters the 
buffer of the node to regulate the queue size. By dropping packets based upon a 
percentage of packets entering the node, each flow is proportionally affected regardless 
of the sending rate of the particular flow. 

Fig. 7 is a graphical representation of the control function H(q) = p. In an 
exemplary embodiment, the control function is composed of two linear segments. The 
first linear segment, segment A, defines the expected operational range of the average 
queue size. The segment is selected to be linear for ease of design and management as a 
control system, although a non-linear segment can be substituted. The maximum point of 
the first segment defines the maximum expected values for the average queue size and 
the corresponding drop rate (gmax* Pmax) when the node is under normal operating 
conditions, i.e. there is not an unexpected change in any of the traffic conditions. For 
example, if a node normally handles up to 1000 simultaneous flows, and traffic 
conditions change producing 100,000 flows, such a scenario would cause the node to 
operate in overload outside of the normal operating conditions. The second linear 
segment, segment B, defines the overload operation range for the average queue size. 
This segment is preferably designed with a steeper slope as shown in Fig. 7,, but does not 
have an infinite slope, as an infinite slope creates a discontinuity and possible 
uncontrollable oscillations to the average queue size. The steeper slope of segment B 
allows the queue to degrade gracefully to its normal operational range along Segment A. 
Segment B is defined by points (tfmax, Pmax) and Oyclip, 1) where <yclip is defined, as 
shown in Fig. 7, as the average queue size at which there is a one hundred percent drop 
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i^fcood that segment B, like segment A^^ry 



rate. It should be una^tood that segment B, like segment A, may be non- linear in 
shape. 

Although the intersection point is the point of convergence for the average queue 
size, the communications network may encounter various changes that affect the queue 
5 law and cause fluctuations affecting the average queue size of the node. These transient 
fluctuations are normal in complex communications networks and the queue average will 
eventually return to the equilibrium point if the system is stable. If the maximum value 
of the control function on segment A (<7max> pmax) lies below the equilibrium point and 
thus inside of the queue law function, the transient behavior of the system is unstable 
10 causing the queue size to fluctuate in an oscillatory pattern. The average queue size can 
oscillate between empty and full buffer, causing the packet drop percentage to vary 
between zero and 100%. As packets are dropped and the sending rate of the sending 
node varies, a similar state can occur in the queue of the sending node. As such, the 
oscillations may reverberate throughout the communications network. 
15 Fig. 8 shows two control functions superimposed upon the queue law function. In 

ijl control function A the expected maximum value (pmax, <?max) for the control function lies 

y above the equilibrium point and therefore outside of the queue law function, allowing the 

*3 transient response of the system to decay so that the oscillations of the system dissipate 

fy and the system returns to the equilibrium point. The maximum value (pmax, <7max) of 

s fl 

: = 20 control function B is shown to be below the equilibrium point and therefore inside the 
j 5 queue law function which results in the average queue size fluctuating causing 

oscillations in the queue and throughout the communications network. 

It should be understood by those skilled in the art that the queue law can have 
various curves depending on the input values of n, the number of flows, M, the average 
25 packet size, and R, the average round trip time. As shown in Fig. 9 by the shaded region, 
the queue law may take on any value within this region dependent upon the values for n, 
M, and R. If the control function is originally set to control function A pictured in Fig. 9 
and the queue law curve is originally Gmin, the system is stable at initialization and large 
fluctuations do not occur in the average queue size. If the parameters n, M, and R change 
30 to their maximum, maximum, and minimum values respectively during the course of 
operation of the node, the queue law curve would then become Gmax, and the control 
function A would have transients which would cause average queue size fluctuation. If in 
contrast, the control function is set initially to control function B of Fig. 9, no matter how 
n, M, and R vary over the course of operation of the node, the equilibrium point is 
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in^fe the queue law curve and all transien^fe: 



always maintained inMe the queue law curve and all transieritTOissipate and the system 
settles at the equilibrium. 

Fig. 10 shows a flow chart of the steps used by either a designer of a congestion 
control module or by a system administrator setting the parameters for defining the 
5 control function of the congestion control module. First, the value of Gma* is calculated 
(step 1000). Using the calculated value Gmax, a designer or system administrator of a 
congestion control module for a node, can design the control function so as to avoid 
oscillations by designating the maximum point of the control function, as defined by 
(pmax, <7maxX outside the maximum queue law curve Gmax (step 1010). The designer or 
10 system administrator can then define the control function to be any function with 

expected operating range ending at (pmax, <?max) outside of Gmax (step 1020). The function 
may already be defined as in a congestion control module which uses RED, and as such a 
system administrator only needs to input parameter values including (pmax, <7max) so that 
[ J ! step 1020 is unnecessary. The designer or system administrator may add a further 

IS 15 segment to the control function which is defined between (pmax, <?max) and the maximum 
■5i buffer size q c \i P which also defines the point at which there is a 100 percent packet drop 

rate. 

□ It should be understood that a computer program can be created for a node 

! i! 

I Vj designer for determining allowable values for the maximum point of the control function 

|lj 20 as defined by the intersection of the queue law function and the control function based 

□ upon expected ranges of traffic conditions for the node. Further this program need not be 
part of a congestion control module and may be used offline during the design phase of a 
node. 

The actual value for (^max, pmax) as explained above may be any value that lies 
25 outside of the queue law function. However, to fix a value for (<7max, pmax), one of the 
two variables must be known and the other variable calculated as shown in the flow 
chart of Fig. 11. Determining (#max, Pmax) is accomplished by first selecting a policy for 
queue management. For example, the policy can be a drop conservative or delay 
conservative policy, but other policies can be used (step 1 100). If the drop conservative 
30 policy is selected, pmax is set by the designer or system administrator (step 1110). If the 
delay conservative policy is selected dmax, the maximum allowable delay, is set by the 
designer or system administrator (step 1115). For a drop conservative policy, given pmax 
, the condition to avoid oscillations becomes >G max (p ma x) providing a formula for 
fixing ^max (step 1 120). Since ^max defines the maximum average queue size, this 
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threshold may be incraRed by 20% to account any spikes in the^verage buffer size so 
that ^max>l '2G max (p ni ax)- It should be understood by those of ordinary skill in the art, that 
the 20% added is used for exemplary purposes and that any value greater than Gmax(Pmax) 
may be used for determining 4^ and avoiding perturbations. As such, the maximum 
5 value for the control function can be set. For a delay conservative policy, a value for d^ 
is given, thus ^max=d max c (step 1 125). Finally, p,^ may be determined by applying 
Pmax > G 'max^max) (step 1 130). As with the drop conservative policy, oscillations may be 
avoided by adding in a factor of 20% such that pmax =1.2 G^maxtemax). Again this 20% is 
used for exemplary purposes and any value greater than the threshold of G^maxOjmax) for 
10 calculating p^ may be used. 

The queue law function can also be used to determine the minimum needed 
buffer size, B for operation in a network environment having a defined set of maximum 
traffic conditions in a fashion similar to the method described with respect to Fig. 11. 
Fig. 12 shows a flow chart of the steps taken in calculating the minimum buffer size. 
15 First, a maximum traffic condition is selected (step 1200). The maximum traffic 

condition is selected based upon the desired policy for queue management. If a drop 
conservative policy is chosen, a low drop probability is desired and the maximum drop 
percentage pmax is known. If a delay conservative policy is chosen, a low average queue 
sy size is desired and a value for the maximum allowable delay dma* is known. The 

20 maximum queue law function Gmax is used to determine the minimum buffer size qmix 

based upon the known traffic condition (step 1210). For the drop cons, policy and for the 
delay conservative #max=dmax c and pmax > G^maxtemax)- The buffer size of the planned 
node is set to any value equal to or above the value for the maximum average queue size, 
?max (step 1220). In a preferred embodiment the buffer size is substantially larger than 
25 the maximum average queue size by an order of two in order to account for fluctuations. 
In another embodiment of the invention for operation in a node in a 
communications network, the congestion control module contains a processor to 
implement a control function and a queue estimator that computes the average queue 
size, which is shown in Fig. 13. A congestion control module receives as input the 
30 average queue size and outputs the drop percentage p. The node congestion control 

module of Fig. 13 implements a feedback node control function p=H(<? e ) for adjusting 
the average queue size of the node, where q c is an estimate of the long-term average of 
the queue size for a node. In such an embodiment the queue law may be used to 
determine the shape of the control function as explained with respect to Fig. 10 and 1 1 
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>in^^size as explained with respect to Fi^^. 



and the minimum buror size as explained with respect to Fig^R. The parameters of the 
control function are preferably set in the node congestion control module by a system 
administrator. For example, if the congestion control module is a RED implementation, 
the values of (^min, Pmin) (<7max, Pmax) and ^ C H P are input to define the control function. The 
5 control function may have any shape and is not limited to the shape as defined by the 
RED implementation. 

The congestion control module receives data flows from a plurality of sources 
into an input. From the input, the data is placed into a buffer. As the buffer is filled with 
data, a queue is created. The position of the data within the buffer is tracked by a pointer 
10 that is controlled by a processor. The average queue size is calculated by a queue 
estimator and is then used to determine the drop percentage based upon the control 
function. The drop percentage is then used to drop packets within the queue, which 
slows down the sending rate evenly across all flows and as a result adjusts the average 
queue size. 



IB 15 The queue estimator is essentially a low-pass filter on the instantaneous queue 

|T| size, which filters out brief queue changes and estimates the long-term queue average. In 



a preferred embodiment, an exponentially weighted moving average is used to calculate 
'3 the average queue size. The average queue size is determined by first taking samples of 

! ; y the queue size every 6 seconds over a given time period, I, and exponentially weighting 

'.U 20 the averaging process by a weight factor, w. The average in the exponentially weighted 
j 3 moving average is calculated recursively based on previous average and a new sample 

q k so that q M = wq k +(l-w)<y k . To calculate the exponentially weighted moving average a 
value for the weight, w is necessary. The value for w is determined based upon the 
values of 5 and I. 

25 Fig. 14 shows a flow chart of the method used in calculating the value of the 

weight w. First, a value for 6 is determined. It, 6, can be as small as possible, but is 
limited by implementation constraints. (Step 1400). The value of 6 should be at most 
Romin, which is the minimum round trip time for a packet, since the throughput of a TCP 
flow can only change rates based on a packet being sent and an acknowledgment either 

30 being received or not being received. As such, the queue size will not change with any 
significance in less than a round trip. A value defining when a sample's contribution is 
negligible is determined (Step 1410). In an exponentially weighted moving average, the 
contribution of each sample decreases exponentially with each passing time interval 8 
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Therefore, the numberof samples which contribute with any significance to the average 
may be calculated assuming a weight smaller than a value, a, which defines the point at 
which the sample's contribution is negligible. The value of a is between 0 and 1 and is 
typically assumed to be 0.1 or 1/e = 0.367. Knowing that, the number of samples, wi, that 
contribute with any significance to the average may be calculated: 



3 



„,_ ln(a) 

m — t r 

In(l-w) 

where the total time interval for all of the samples, I, may be substituted based on the fact 



that I=mS , and therefore, w=l-a 



6/1 



10 \ value for I is then calculated (STEP 1420). The value of I is determined as the 

result of two opposing conditions for affecting the exponentially weighted moving 
average. The^irst condition is to provide an acceptable approximation of the long-term 
average of the system assuming a constant number of flows, wherein the approximation 
accurately accounts\for small perturbations in the instantaneous average. The second 
15 opposing condition fo\the queue averaging algorithm is the ability of the approximation 
to account quickly for changes to the system such as a substantial increase in the number 
of flows using the following assumptions of the network, which provides a compromise 
between the two conditions i\found. In a communications network that has n flows 
flowing into a node which have\the same average round trip time, the throughput of each 
P 20 flow is T=c/n and each flow has the same drop rate p. Additionally, it is assumed that the 
network implements a TCP congestion control algorithm in which the sending rate 
decreases when a packet is not acknowledged. In such a network, the sending rate over 
\ time is linearly increasing for a flow until\a packet is dropped, when the rate is then 
decreased in half as shown in Fig. 15. The period of this function is denoted by P. The 
25 variation in sending rate is reflected in a si mi Far variation in the queue size and thus the 
queue size has the same periodicity P. If the averaging interval of the moving average is 
equal to the period, then the average is equal to th\ long term average and the value does 
not change when the interval is translated in time. IfNthe interval is smaller than the 
period I<P, then the average is no longer constant. For^P, the moving average has a 
30 small variation, but converges rapidly to the long-term average. 

P can then be determined as a function of average drop rate p and average round 
trip time R: 
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Tuffo,)*,)* Q{pMp)Mp% ~ wM<w 

"k 



P = A(p,R) = 

8 



+ ^ Vjr> 013x7 V1W 0 otherwise 
l-p 



Since P is a function of p and R, P can take multiple values depending on traffic 
conditions. For the purpose of ensuring the stability of the control system, I is taken to 
be the maximum value of P over all traffic conditions n, M, R, Wmax, within the 
5 predefined range n^, n^, M min , M^, Rmin, Rmax, W^n^. I is chosen to be 

the maximum value of P so that the smaller intervals which produce averages with high 
variation can be accounted for. I is computed as follows: 

/ = max(A(p 1 , R Qlwx +q x Ic\ A{p 2 , R 0 max + q 2 I c)) 

where A is defined above and 
10 (pi^i) is the solution to 

in q\ = GminCpi) 

m pi = H(^0 and 

jti (P2,qi) is the solution to 

^ qi - G max (p 2 ) 

□ 15 p 2 =H(^ 2 ) 

!=y The weight can then be calculated such that w=l-a (Step 1430). From the value of the 

weight w, the long term average of the queue size can be calculated using the 
Q exponentially weighted moving average described above. The long term average queue 

size is then passed from the queue estimator to the processor, which uses the queue size 

20 to determine the drop probability for the node. 

In yet another embodiment as shown in Fig. 16, in which a queue estimator and 
control function are used to provide congestion control in real-time for a node, the queue 
law may be used to periodically correct the location of the queue in the buffer. In such an 
embodiment, the queue law function is calculated on a periodic basis in a queue law 

25 module based upon input traffic conditions, which are sensed by sensors attached to the 
node housing the congestion control module. The queue law module may be part of a 
processor for calculating the control function or a separate processor. From the queue 
law function, the expected operation point may be calculated by finding the intersection 
of the queue law function with that of the control function of the congestion control 

30 module. The operation point, can then be compared to either the instantaneous queue 
size or an estimated moving average based upon instantaneous queue samples. If the 
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instantaneous queue^ffe or the estimated moving average is larger than the estimated 
queue size, the drop probability is increased to reduce the sending rate and if the queue 
size is smaller than the estimated queue size the drop probability is decreased so that the 
sending rate increases. In such an embodiment, the average queue size is adjusted to the 
5 estimated value the queue law function provides a means for compensating for drift in 
the queue, which may result from the queue estimator. 

It should be understood by those of ordinary skill in the art that the method for 
defining the control function and thus (pmax, <?max)> the method for defining the minimum 
buffer size and the method for determining the weight may be combined for determining 
10 RED parameters in a systematic manner. These RED parameters may then be entered by 
a system administrator for use in a RED congestion control module. Further, if the RED 
congestion control module requires a value for 6 the method for calculating 6 may also 
,5 be part of the method for systematically determining the RED parameters. 

Fig. 17A shows an illustrative configuration module in which system parameters 
15 as defined above are input into a configuration module and the disclosed methods for 
determining (pmax, <?max ) and the buffer size are implemented. The output of the module 
which is control function configuration parameters may then be used by a system 
administrator to configure a control function module which resides within a congestion 
ij control module as shown in Fig. 17B. The control function configuration parameters of 

U 20 the buffer size and the maximum point of the control function (pmax, <?max ) are used in 
3 combination with the desired queue policy to define the control function. Once the 

control function is defined the control function module can receive the average queue 
size as input and will then output the drop probability for the node containing the control 
module. Fig. 18A shows a weight calculation module that receives as input system 
25 parameters and at an intermediary stage determines the sampling period and the number 
of samples that significantly contribute to the average queue size. This weight calculation 
module may be part of the congestion control module or a completely independent 
module. The weight calculation module then uses the number of samples and the 
sampling period to determine a weighting value, which is then used for estimating the 
30 average queue size. Fig. 18B shows a congestion control module which is similar to the 
congestion control module of Fig. 13, but Fig. 18B also shows the data which is 
necessary for configuring the congestion control module prior to operation. As shown in 
Fig. 18B the weight is necessary for the queue estimator and the control function 
configuration parameters are necessary for the configuration of the control function 
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which is part of the^Ttrol function module. 

In an alternative embodiment, the disclosed method and apparatus for queue 
management may be implemented as a computer program product for use with a 
computer system. Such implementation may include a series of computer instructions 
fixed either on a tangible medium, such as a computer readable medium {e.g., a diskette, 
CD-ROM, ROM, or fixed disk) or transmittable to a computer system, via a modem or 
other interface device, such as a communications adapter connected to a network over a 
medium. The medium may be either a tangible medium (e.g., optical or analog 
communications lines) or a medium implemented with wireless techniques (e.g., 
microwave, infrared or other transmission techniques). The series of computer 
instructions embodies all or part of the functionality previously described herein with 
respect to the system. Those skilled in the art should appreciate that such computer 
instructions can be written in a number of programming languages for use with many 
computer architectures or operating systems. Furthermore, such instructions may be 
stored in any memory device, such as semiconductor, magnetic, optical or other memory 
devices, and may be transmitted using any communications technology, such as optical, 
infrared, microwave, or other transmission technologies. It is expected that such a 
computer program product may be distributed as a removable medium with 
accompanying printed or electronic documentation (e.g., shrink wrapped software), 
preloaded with a computer system (e.g., on system ROM or fixed disk), or distributed 
from a server or electronic bulletin board over the network (e.g., the Internet or World 
Wide Web). Of course, some embodiments of the invention may be implemented as a 
combination of both software (e.g., a computer program product) and hardware. Still 
other embodiments of the invention are implemented as entirely hardware, or entirely 
software (e.g., a computer program product). 

Although various exemplary embodiments of the invention have been disclosed, 
it should be apparent to those skilled in the art that various changes and modifications 
can be made which will achieve some of the advantages of the invention without 
departing from the true scope of the invention. These and other obvious modifications 
are intended to be covered by the appended claims. 
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